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Abstract 

We consider the Steiner tree problem in quasi-bipartite graphs, where no two Steiner vertices are 
connected by an edge. For this class of instances, we present an efficient algorithm to exactly solve 
the so called directed component relaxation (DCR), a specific form of hypergraphic LP relaxation that 
was instrumental in the recent break-through result by Byrka et al. Q. Our algorithm hinges on an 
efficiently computable map from extreme points of the bidirected cut relaxation to feasible solutions of 
(DCR). As a consequence, together with |2 | we immediately obtain an efficient 73/60-approximation 
for quasi-bipartite Steiner tree instances. We also present a particularly simple (BCR)-based random 
sampling algorithm that achieves a performance guarantee slightly better than 77/60. 

1 Introduction 

In the Steiner tree problem, we are given an undirected graph G ~ {V, E) with costs c on edges and its 
vertex set partitioned into terminals (denoted R QV) and Steiner vertices {V \ R). A Steiner tree is a 
tree spanning all of R plus any subset ofVQR, and the problem is to find a minimum-cost such tree. 
The Steiner tree problem is APX-hard, thus the best we can hope for is a constant-factor approximation 
algorithm. In particular, the best inapproximability known (assuming P ^ NP) is 1.01063 (> ||) due 
to Chlebik and Chlebikova |6|. For the special family of instances that are known as quasi-bipartite 
graphs, which is the subject of our work, the best hardness known is 1.00791 (> jp) |l6l, under the 
same complexity assumption. 

In a recent break-through paper, Byrka, Grandoni, RothvoB and Sanita ||2l|3] presented the currently 
best approximation algorithm known for the problem. The algorithm has a performance ratio of ln(4) + e 
for any fixed e > 0, and it iteratively rounds solutions to a so called hypergraphic linear program. 
Such LPs commonly have a variable xk for each K C R, representing a full component spanning the 
terminals of K. A full component is a tree whose leaves are terminals and whose internal vertices are 
non- terminals. 

Figure [T](i) shows an example of a full component spanning a set of terminals (squares). There are 
several equivalent hypergraphic LPs f4l; here we focus on the directed component relaxation (DCR) that 
was first introduced by Polzin and Vahdati Daneshmand 1 10] , and then later used by Byrka et al. IJIEI- 
We will now describe this LP. 

Given a full component K and one of its terminals u, we obtain a directed full component by ori- 
enting all of X's edges towards the sink node u. Vertices v ^ K \ u are called sources; an illustration 
is given in Figure [T](ii). Note that when there are no Steiner-Steiner edges, i.e. when the instance is 
quasi-bipartite, then every full component is associated with only one Steiner vertex which we call the 
centre of the full component. 

In the following, we let K, denote the set of all directed full components, and for K E K,, we let 
sink(K) be the sink node of K. We will sometimes abuse notation, and use K £ K, for the set of arcs of 
the corresponding oriented full component, and for the set of terminals it spans interchangeably. We use 
Cfc for the cost of the full component K. For a set L/ C R, we let A+([/) denote the set of components 
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(i) 



(ii) 



Figure 1 : Example of a full components and a directed full component. 



K ^ JC whose sink lies outside U and that have at least one source in U . In this case, we will also say 
that K crosses U . We also use x{S) as a short for ^ 



(DCR) has a variable for every K £ K., and 
a constraint for every set [/ C R\r, where r G i? is an arbitrarily chosen root node. In the following 
we say that U CV is valid if it contains at least one terminal, but not the root. 



min ckXk 
KeK 

s.t. x(A+(C/)) > 1 VC/ C i? 
a; > 



(DCR) 



min ^ CaXa 

S.t. x{5+{U))>l V valid C/cy 
X > 



(BCR) 



Goemans et al. fSl] recently showed that solving ( |DCR| l is strongly NP-hard. Nevertheless, for any 
fixed e there exist an efficient (1 + e) -approximation for the value of ( |DCR| l in the following sense: Let 
(DCR(fc)) be the version of ( |DCR| l that omits variables for full components with more than k terminals. 
Borchers and Du |T| showed that the optimum value of (DCR(fc)) is larger than that of ( |DCR| l by at most 
a factor pk where 

{t + 1)2* + s 



Pk = 



i2* 



where we let i G N and s < 2* such that A; = 2* + s. Byrka et al. ||2||3J compute such an approximate 
solution of ( |DCR| l, and the performance guarantee of their algorithm is pk • In 4; for every e, the value k 
can be chosen large enough such that this is at most ln(4) + e. One easily sees that already for moderately 
small values of e, large values of k need to be chosen. E.g., for • ln4 to be smaller than 1.6, we need 
k to be bigger than 90 (compare this to In 4 < 1.39). For such values of k, solving (DCR(fc)) becomes 
a challenge since even compact reformulations of (DCR(fc)) 121131 have 0{n^) variables, and equally 
many constraints. 

In this paper, we study the bidirected cut relaxation ( |BCR| l Q. In this relaxation, we convert first 
the original instance G — (V, E) into the digraph D = {V, A), where A has arcs (u, v) and {v, u) for 
every edge uv G E; both arcs have the same cost as uv. We once again pick an arbitrary root terminal 
r G R, and call asetU C V valid if it contains terminals but not the root. ( |BCR| l has a variable for every 
arc in A, and a constraint for every valid set. 

Despite the fact that this relaxation is widely considered to be strong, its integrality gap is only 
known to be at least 36/31 ||3l, and at most 2. The known lower and upper bounds on the integrality gap 
of (|DCR|, on the other hand, are 8/7 [W] and ln(4) |8|. 

In this note we focus on the class of quasi-bipartite Steiner tree instances - instances, where no two 
Steiner nodes are connected by an edge. Our main result for such instances with n many vertices and m 
many edges is the following. 

Theorem 1. For quasi-bipartite Steiner tree instances, ( |DCR| l can be solved exactly using 0{mn^) 
minimum s, t-cut computations in graphs with 0{mn) vertices. 
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We accomplish this by solving ( |BCR| l, and by giving an efficient decomposition algorithm that maps 
the given minimal ( |BCR l solution to one of ( |DCR| l. We note that Chakrabarty et al. had previously 
shown that ( |BCR| l and ( DCR| l have the same optimal values in quasi-bipartite graphs. The proof in L4J 
uses "dual" arguments, however, and it is not clear how to obtain a "primal" algorithm. 

The above theorem has a couple of consequences. First, we can use it together with ||2l[3] to obtain 
an efficient 73/60-approximation for quasi-bipartite Steiner tree instances. We also obtain a slightly 
weaker 1.28-approximation that uses a particularly simple sampling strategy based on ( |BCR[ ). 

We remark that Goemans et al. fSl have recently obtained an alternative proof of Theorem [T] The 
work presented here was obtained before [SJ appeared on the arXiv, and is therefore independent. 



2 Decomposing (BCR) extreme points 



In this section we provide a proof of Theorem [T] In the following fix a quasi-bipartite instance of the 
Steiner tree problem. Let G — (V, £') be the input graph, R C V the set of terminals, and Cg a non- 
negative cost for each of the edges e ^ E. Also let D = (V, A) be the digraph obtained from G by 
replacing each edge e — uvhy two arcs {u, v), and {v, u) each having cost Cg. We choose a fixed root 
node r ^ R, and call asetU CV valid if it contains some terminals, but not the root. 

Let y be a feasible solution for ( |DCR| l. We define the following natural map from the space M'^ to 



K 



where xk is the characteristic vector of the arcs of full component K. The proof of the following 
observation is straight forward, and makes use of the fact that a full component crosses a valid set U 
only if at least one of its arcs does. 

Observation 2.1fy is feasible for ( |DCR| l then is feasible for ( |BCR| l. 

Notice that $ is cost-preserving, and it therefore follows immediately that the optimum solution 
value of ( |BCR| l is at most that of ( |DCR| l. In order to prove Theorem [T] it suffices to show that, in the 
case of quasi-bipartite graphs, the optimum of ( |DCR| l is at most the optimum of ( |BCR[ ) as well. We 
accomplish this by showing that, given a minimal solution x of ( |BCR| l, we can efficiently find a minimal 
solution y of ( |DCR[ ) such that $(?/) = x. We start by giving an overview of the proof. We define the 
following polyhedron: 

I := {{x,y) eRi xR'^ : x{S+{U)) +yiA+{U)) >1, V valid C/cy}. (1) 

Clearly, if x is feasible for ( |BCR| l then {x, 0) € I. Call a full component K E K, feasible with respect to 
(x, y) E I if we can shift fractional A-weight from the arcs of K to the full component K. Formally, K 
is feasible if 

{x - X ■ XK,y + >' ■ bk) El, (2) 

for some A > 0, where ck is the standard orthonormal vector indexed by full components in K,. Our 
first goal then is to show in Section 2.1 that a feasible component always exists. Then in Section 2.2 we 
show how to efficiently compute such a feasible component K, which allows us to find in Section 2.3 the 
maximum A corresponding to K such that (|2| holds. Our strategy then is self-evident. Starting with the 
initial feasible vector {x^, — {x, 0) to I, we define a sequence of values A^, A^, ... as above, giving 
rise to a sequence of feasible vectors (x*, i/*) = — A' • xk, V^^^ + ^* ■ ^k) to I, where K is the 



full component corresponding to the value A' . Finally, in Section 2.4 we argue that the sequence above 
converges in polynomial many steps into a feasible vector (0, y) to I. Since the weight shifting at every 
step preserves the total cost, our main theorem follows. 

We now fill in the details, and begin with a few existential results. Subsequently, we show how to 
obtain a strongly polynomial decomposition algorithm. 
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2.1 Existential results 



In the following it will be convenient to study slight generalizations of ( |BCR| l and ( |DCR[ ). Let / be an 
intersecting supennodular function defined on subsets of terminals; i.e., we have 

f{A) + f{B)<f{AC^B) + f{Ai^B), 



for any A, B C i? with A n B ^ 0. We then obtain the LPs ( |BCR/| l and ( |DCR/| l by replacing 1 by / on 
the right-hand sides. 



min ckxk (DCR/) 

s.t. x(A+(C/)) > /(t/) VC/Ci? 
a; > 



(BCR/) 



S.t. x{5+{U))> f{Ur\R) V valid C/CF 



Chakrabarty et al. [4| showed that the optimal values of (jBCRjl and ( DCR| l coincide for quasi-bipartite 
instances. It is an easy exercise to see that their proof extends to ( BCR/| l and ( DCR j 
alternate proof of this fact in the appendix. 



We provide an 



Theorem 3. The optimal values of ( BCR / i and ( DCR / i coincide for quasi-bipartite instances, non- 
negative (not necessarily symmetric) costs c, and intersecting supennodular function f. 

The following is now an easy corollary. 



Lemma 4. IfG is quasi-bipartite, f is intersecting supennodular, and x is an extreme point of (BCR/ 
then there exists an extreme point y of (DCR/ I such that <!>(?/) — x. 



Proof. By the theory of linear programming, there is c e R-^ such that x is the unique optimal solution 
of ( |BCR/| . Since the feasible region of (BCR/ i is upward-closed, we have c > 0, for otherwise x would 



obtain a feasible solution to (DCR/ 



Since (DCR 



By Theorem 3 



not be optimal. We claim next that ( DCR / i is feasible. Indeed, since ( BCR / i is feasible, we know 



f{R), /(0) < 0, and hence ( DCR| l is feasible. Any feasible solution to ( |DCR| i can now be scaled to 



is feasible and c > 0, we may let y be an optimal extreme point solution to ( DCR / 

^x since $ preserves cost. 



T 

y = c X. 



Let X — $(?/), and observe that c 
Observation [2[applies also to (|BCR/[ ) and ( |DCR 
unique optimal solution to (BCR/ i for costs c, we must have x 



T ~ 
X 



T 

c y 



and shows that x is feasible for (BCR/ 



As X is the 
X, and this completes the proof. □ 



The rest of this section focuses on making the above existential proof constructive. In the following 
we once more abuse notation, and use S^{U) (A+(C/)) as the incidence vector of arcs (full components) 
that cross valid set U; (5+([/), and Aj^([/) then denote the component of this vector corresponding to 
arc a and full component K, respectively. We obtain the following plausible lemma. 

Lemma 5. For every K £ IC, is submodular i.e. ifU. W C R, then 

A+{U) + A+{W) > A+ ([/ r\W) + A+{U U W). 

Proof. We proceed by case analysis. If the right-hand side is zero, the inequality is trivial. Case 1: 
Suppose A;^(C/ r]W) = I and A^(J7 U M^) = 0. Then, without loss of generality, the sink of K lies 
outside U. Since K has a sink in t/ n W, in particular in U, this implies A^(?7) = 1. Case 2: Suppose 
A'^{U n W) = and A^(C/ U W) = 1. Then, without loss of generality, K has a source inside U. 
Since the sink of K lies outside U U W, in particular U, this implies Aj([/) = 1. Case 3: Finally, 
suppose A'^{U n W) — 1 and Aj([/ U W) = 1. Then K has a source inside U r\W and its sink lies 

■ ([/) = A+{W) = L □ 



outside both U and W, so A^ 



The following lemma shows that a ( |BCR| l extreme point can indeed be decomposed iteratively into 
fuU components. 
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Lemma 6. Let G be quasi-bipartite, let f be intersecting supermodular, let x be a minimal feasible 
solution of (BCR/ i and let vu & Abe such that v ^ R,u € R, and Xy^ > 0. Then there exists A > 
and K G K, with vu G K such that 

X ■.= x- XxK (3) 

is minimally feasible in (BCRf ) where f is obtained from f by reducing f{U) by Xfor all valid U that 
are crossed by K. Moreover, for any such A, /' is again intersecting supermodular 



Proof. As X is a minimal feasible solution to (BCR/ 



we can write it as 



i=l 

where x^, . . . , x*^ are (BCR/ i extreme points, a > 0, and \^ a ~ 1. Since x^u > 0, xl^^ > for some 
j. By Lemma|4] for every i there exit such that — and is feasible to (DCR/ I. Let K E K, 

be any component with vu g K and y]^ > 0. 

Clearly, y = X^iLi '^iV^ ^ feasible point of (DCR/ i. Now obtain y' by reducing the Kth compo- 
nent of y^ in the above convex combination to 0; i.e., let 



y 



Let f — f — ajy-'j^A^; i.e., we obtain /' from / by reducing f{U) by ajt/j^ if K crosses U. Note that 
/' is intersecting supermodular as / is intersecting supermodular, and is submodular 

Simply from the definition of /' it now follows that y' is feasible for (DCRy), and Observation |2] 
shows that x' — ^{y') is feasible for (BCR//). From the definitions of $ and y' it also follows that 
x' = X — ctjy'^j^XK, and we therefore choose A — ajy''i( in 

Finally, suppose for the sake of contradiction that x' is not a minimal solution to (BCR//); i.e., there 
is an arc a and e > such that x" — x' — tea is feasible for (BCR//). Then 

x((5+(C/)) - A|{a : a e A' n (5+(C/)}| - eC(C/) = 

a;'(<5+(C/))-6e(C/) > /'(C/ni?), 

for all valid U CV. Thus, we have 

x{S+{U)) - e5+{U) > f{U n i?) + \\{a : a e K D S+{U)}\, 

for all valid U. Note that the right hand side of this inequality is at least f{U n R) as Af^{U) < 
Xk(S^{U)) for all U. Thus, x is not a minimal (BCR/ i solution, and this is the desired contradiction. 

□ 



2.2 Towards efficiency I : Finding feasible components 

What conditions are sufficient for a full component K to be feasible? Well, certainly we need Xa > 
for all a £ K. Beyond this, feasibility is characterized by tight valid constraints. We say that valid set U 
is tight for {x, y) if the corresponding constraint in I is satisfied with equality. It is easy to see that K is 
valid iff every tight set crossed by K is crossed by at most one of its arcs; i.e., 

A+ ([/) > 5+{U) (4) 

aeK 

holds for all tight sets U . In fact, it suffices to look at certain tight sets. 

In what follows, fix a full component K along with its centre v and sink u. Figure |2](i) shows a tight 
set G that contains both the sink u and a source terminal w but not the centre v. In this case, the arc 
(ly, v) crosses C, but K does not, and Q is violated. We let C be the set of neighbours of v that don't 
lie in such a tight set, and are hence eligible source nodes: 

C = {w £ T{v) \u : $ tight vahd set U with u,w e U and v ^ U}. 
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(i) 



(iii) 



Figure 2: Three classes of tight sets. 



Figure [2] (ii) shows a tiglit set X that does not contain the centre v nor the sink u of K, but two sources 
w and z. Since two of K's arcs cross X, component K is once again not feasible. A feasible component 
may contain at most one source from a tight set X like this. We let X be the set of all eligible source 
node sets contained in such tight sets: 

X ^ {X nC : X is a tight vaUd set with u,v ^ X}. 

Finally, Figure |2] (iii) shows a set Y that contains K's centre but not its sink. In this case, K must contain 
one of its sources in Y as otherwise K would not cross Y. We let y be the set of all eligible source node 
sets contained in such tight sets: 

y ^ {Y nC : y is a tight vaHd set with v eY,u 1^ Y}. 

Lemma 7. A component K with centre v and sink u is feasible iff (a) Xa > 0/or all a G K, (b) sources 
of K lie in C, (c) K has at most one source in each set in X, and (d) K has at least one source in each 
set in y. 

Proof If K is feasible then clearly (a)-(d) above needs to be satisfied. We prove the converse. 

Suppose that (a)-(d) are satisfied for some full component K with centre v and sink u. Since Xa > 
for alia G K it suffices to check that Q holds for all tight vaUd sets U. 

Consider a particular tight valid set U, and suppose first that K crosses U; i.e., K has its sink outside 
U, and at least one of its sources is in U. Then Q is satisfied if S^{U) has at most one of K's arcs. 
Suppose for the sake of contradiction that S^{U) has more than one arc from K. In this case, v ^ U, 
and U n sources(i4') £ X. But in this case, (b) implies that K can have at most one source in [/; a 
contradiction. 

Now suppose that K does not cross U, and assume for contradiction that S^{U) has some of K's 
arcs. Assume first that {v, u) e 6^{U). In this case, U n sources(ii') G 3^, and hence, by (d), K must 
have a source in U, and therefore K crosses U ; a contradiction. Now assume that some arc {w, v) G K 
crosses U. In this case w is a source of K, and u must be in U as otherwise K would cross U. But this 
means that w ^ C, and we arrive yet again at a contradiction. 

Thus, K satisfies the condition in Q for all tight sets U. □ 

We need the following standard uncrossing lemma. 

Lemma 8. Let S,T QV be tight such that 5' H T H i? 7^ 0. Then S HT and S UT are also tight valid 
sets. 

Proof Since S" n T n i? 7^ 0, 5 n T and S" U T are vaUd, and hence 

2-y(A+(5))-y(A+(T)) - x{5+{S)) + x{S+{T)) 

> x{5+{SnT))+x{S+{SUT)) 

> 2-y{A+{SnT)-ylA+{SUT)) 

> 2-2/(A+(5))-y(A+(r)), 



6 



where the first inequality uses the submodularity of x{S'^{-)), the second inequality follows from feasi- 
bility of the constraints in X, and the last inequality uses Lemma |5] It follows that all inequalities above 
hold with equality. □ 



The last puzzle piece needed before we present an algorithm to find feasible components is the 
following structural fact. 

Lemma 9. X and y are closed under intersection and union. 

Proof. Suppose Xi,X2 G X and Xif] X2 ^ 0. Then, for « e {1, 2}, there is a tight valid set Ui that 
does not contain v and u, and = [/^ n C. Clearly, Ui n U2 and Ui U U2 are also valid, and they are 
tight by Lemma[8] Neither Ui n U2 nor Ui U U2 contain u and v. Thus C/i n C/2 n C = n and 
J7i U J72 n C = U X2 are also part of X. 

Similarly, if distinct Yi, I2 G y intersect then, for i E {1,2}, there exists a tight valid set Ui with 
u^Ui,v eUi and Y^ = Uin C. So by Lemmajs] U' ^UiH U2 and U" = C/i U f72 are tight valid sets 
as well. Both U' and U" contain v and not u. Therefore C/' n C = Yi n 1^2 and f7" n C = Yi U Y2 are 
also sets of y. □ 

Lemma |9] allows us to slightly refine the conditions in Lemma |7] Let us define X* to be the set of 
inclusion-wise maximal sets in X, and let y* be the inclusion-wise minimal elements of y. Then we 
may replace in the statement of Lemma |7] the X in (c) by A"*, and the y in (d) by 3^*. Moreover, the sets 
X* ,y* along with C can be found in polynomial time as we explain in the next lemma. 

Lemma 10. For {x,y) £ I, the sets C,X* ,y* can be found using 0{n) minimum-capacity s,t-cut 
computations. 

Proof Let D be the digraph that has node set 

V U {s} U {vk '■ K full component with uk > Q}- 

D has an arc for each arc a in the support of x\ the capacity of this arc will be Xa- For each full 
component K in the support of y, we add K's, arcs, using node vk instead of K's, real centre v. The sink 
arc [vk, u) has capacity uk, and all source arcs have infinite capacity. We will augment this graph and 
specify capacities in order to find C, X* , and 3^*. 

In order to determine whether w £ r(t;) \ u is in C we need to check whether there is a tight valid 
set U that contains both u and w but not v. We obtain the graph Z?^ from D by adding arcs (s, w) and 
(s, u) of infinite capacity. We also assign infinite capacity to arc {v, r). Using the feasibility of {x, y) for 
I it follows that any cut separating s and r has capacity at least 1. Furthermore, the minimum-capacity 
such cut has capacity 1 if and only if w ^ C. In order to compute C it suffices to check all w e r(i>) \ u. 

The strategy to find X* is very similar to the above procedure for determining C. For each terminal 
w E C w& find, if it exists, a inclusion-wise maximal vahd set U containing w but not u and v. In order 
to do this, we obtain graph D'^ from D by adding arc (s, w) of infinite capacity, assign infinite capacity 
to arcs {v, u), and {u, r). Once again, the minimum s, r-cut in this graph has capacity at least 1, and it 
is exactly 1 if X has a set containing w. In the latter case, it suffices to compute a maximal min s, r-cut 
in this graph, and include it in X* . After having done this for all w E r(w) \ u, and after deleting all 
non-maximal sets, Lemma|9]implies that X* is a family of pair-wise disjoint sets. 

Finally, in order to compute y*, we create the following graph for every w E C: add two arcs 
(s, w), and (s, v) of infinite capacity to D, and assign infinite capacity to (u, r). Once more by feasibility, 
a maximum s, r-flow in this graph has value at least 1, and value exactly 1 if there is a J^-set containing 
w. In the latter case, we compute an inclusion-wise minimal mincut and add its intersection with C to 
3^*. We repeat the procedure for all w E C. By Lemma|9j the family y* contains pair- wise disjoint sets, 
once we clean up by deleting all non-minimal sets. 

Finally, note that in all cases above, we perform n many mincut computations. □ 

We are now ready to show how to efficiently find a feasible component. 
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Lemma 11. Let (x, y) G T, and suppose that there is a feasible component. Then there is an algorithm 
to find such a component that runs in time 0{nT„ic), where t„ic is the time needed to find a minimum- 
capacity s, t-cut. 

Proof. Choose a Steiner vertex v, and sink node u such that x^u > 0. We know from Lemma[6]that there 



is a feasible component K with centre v and sink u. By Lemma 10 the corresponding sets C, X* ,y* 
can be computed in time 0{nTmc)- We can then find a feasible component with centre v and sink u by 
computing a max flow in a bipartite auxiliary graph. Introduce a vertex x for every set X ^ X* , and a 
vertex y for every Y ^y* . Add an arc (x, y) if the corresponding sets X and Y share a terminal from C. 
Also connect each of the y* nodes to a sink node t, and give each of these arcs unit capacity. Similarly, 
introduce a source node s, and connect it to all X* nodes via unit-capacity arcs. Observe that a maxflow 
of value 1 3^* I exists iff there is a feasible component with sink arc (w, w). Let h be such a maximum flow, 
and let S be the set of terminals corresponding to edges {x, y) with hxy = 1. It follows from Lemma|7] 
that 

{{w,v) : w e 5} U {{v,u)} 
is a feasible full component. □ 

2.3 Towards efficiency II : Finding the step weight A 

In this section we assume that we have a minimal feasible point (x, y) G /, and a feasible component K. 
The following lemma estabhshes that we can find the largest A such that 

(a;^, y^) [x - XxK,y + Ae/f ) 

is in I. 

Lemma 12. Given a minimal feasible point (x, y) e I, we can find the largest A such that {x^, y'^) is 
feasible for T. Our algorithm runs in time 0{n'^Tmc)- 

Proof. Let us first choose A*^ = miua^K Xa\ clearly, a larger value of A would result in some negative 

A X , 

X variables, [x , y ) may still not be feasible, and violate some of the valid cut inequalities. We now 
look for a valid set U that is violated the most. 

Once again this is accomplished by min s, r-cut computations in a suitable auxiliary graph. Do the 



following for each w € R. Start with the graph D used in Lemma 1 1 Let the capacity of every arc 
a G Ahe x^ , and let the capacity of arc Vk,uhe y^' for all iiT e /C. Finally add an arc {s,w) of infinite 
capacity. If (x^ , y^") is feasible then the max s, r-flow in this graph is at least 1. If it is lower, let Uw 
be the vertex set corresponding to a minimum s, r-cut. 

Among all the sets Uw found this way, let be one of minimum capacity. Choose A^ < A° such 
that (x^ , ) satisfies the cut constraint for set JJ". The new point (x^ , y^ ) may still not be feasible. 
There may be a valid set U that is violated by this point. As a function of A, the violation of the constraint 
for set U is 

hu{X) = (1 - x{6+{U)) yiA+iU))) A(|^+(f/) nK\- A+ ([/)), 

where, we recall, |(5+(C/) n K\ is the number of arcs in K that cross U, and A'^{U) is 1 if K crosses U, 
and otherwise. Call the coefficient of A in the above expression a{U), and note that it is an integer 

Recall now that we chose U'^ as the valid set with maximum violation. The fact that is not 
violated by A\ but U means that a{U) < a{U°). In fact, all vaHd sets [/' with a{U') > a(C/°) are 
satisfied by (x'^ ,y^ ), following the previous argument. 

Note that a{U) is at most n, and non-negative. We continue in the same fashion: for (x^ , ) we 
look for a valid set that is maximally violated, and choose A'^ < A^ largest so that this set is satisfied. 

This produces a sequence of A's and corresponding valid sets 

C/0,C/\C/2,..., 

such that a{U°) > a{U^) > a{U^) > Clearly, this process has to terminate within in n steps. □ 
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2.4 Efficiency: Putting things together 
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We are now ready to state the entire polynomial-time algorithm for computing the decomposition of a 
minimal ( |BCR| l solution x. 

Algorithm 13. Decompose 

Require: a; is a minimal feasible solution of ( |BCR| l. 
Initialize y eRfto 0. 

while there isvu^A with v ^ R,u G R, Xyu > do 
while Xvu > do 

Find a feasible component K E JC, with centre v and sink u. (Lemma 
Find the greatest A > such that {x^^ y^) € I. (Lemma 
Set {x,y) to {x^,y^) € I. 
end while 
end while 
return y. 

Our algorithm maintains as an invariant that (x, y) is a minimal feasible point in I. Note that Lemma 
|5]implies that the function / defined by 

/([/) = l-y(A+(;7)) 

for all valid U C Ris intersecting supermodular. Lemma |6] then guarantees the existence of a feasible 
component K E JC. LemmafTS] implies that the above invariant is maintained throughout. It remains to 
show that steps|4]-[6]are executed a polynomial number of times. 

Call a step saturating if the support x decreases; i.e., some arc variable Xa is decreased to 0. Obvi- 
ously, the number of such events are upper bounded by 0{m), where m is the number of edges in the 
original graph G. 

Let us focus on non-saturating steps. Let {x, y) be the point in I in step|4] and let K be the full 
component chosen. We find A in step p] and note that the supports of x and x'^ have the same size. The 
increase of A is thus deteimined by some valid set U as follows: U is non-tight for (a;, y) and tight for 

{x^,y^). 



Our choice of K implies (see also Lemma 1 1 1 that sets U that are tight for {x,y) are also tight for 



{x ,y ). K is certainly not feasible for {x\ y ), and hence, once again by Lemma 11 at least one of C, 
X* ,oxy* must have changed. 

As a set U that is tight for {x,y) is tight also for {x^,y^), C can only shrink, and the number of 
times this can happen is clearly bounded by n. Similarly, the new sets X, and 3^ are supersets of their 
old counterparts. 

Focus on X, and let C and C be maximal laminar families in X for {x, y) and [x^^ y^), respectively. 
The set X* precisely consists of the maximal sets of £. If X* changes then this means that the set of 
maximal sets in laminar families C and £' differ. This can happen only for one of two reasons: the sets 
in C cover more terminals than those in £, or two maximal sets in C are now part of the same maximal 
set in C . Clearly, the number of such events is bounded by 0(|i4'|) — 0{n). 

The argument for y is similar, and we omit it here. In summary we have proved the following: 

Lemma 14. Between any two saturating steps, the algorithm performs at most 0{n) non- saturating 
ones. Thus the total number of times steps^-^of Algorithm 13 are executed is bounded by 0{mn). 



Note that this means that at most 0{mn) full components are added throughout the algorithm, and 



that the auxiliary graph used in the mincut computations in Algorithm 13 has at most 0{mn) nodes. 
This proves Theorem[T] 



3 An application: Sampling without decomposition 

In this section, we employ the existential result given in Lemma |4] to give a compact and fast imple- 
mentation of a recent (DCR)-based LP-rounding algorithm (henceforth referred to by CKP) given by 
Chakrabarty et al. i5J for the case of quasi-bipartite Steiner tree instances. 
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We first review the algorithm CKP in the special case of quasi-bipartite Steiner tree instances. Given 
such an instance, CKP first solves ( |DCR[ ); let y be the corresponding basic optimal solution, and let 
M = l^y. The algorithm now repeats the following sampling step AI hi3 times: sample component 
K E K, independently with probability yx/M. In G, contract if's cheapest edge (the so-called loss 
of K), and continue. Let G' be the final contracted graph, and let S be the set of centre vertices of the 
M In 3 sampled full components. The algorithm now returns a minimum-cost tree spanning the terminals 
R, and the set S. 

Chakxabarty et al. showed that the expected cost of the returned solution is no more than 1.28 times 
the value of the initial ( |DCR| l solution. We observe here that when it comes to quasi-bipartite graphs, 
the above process that iteratively samples components, can be alternatively interpreted as sampling their 
centers. Each Steiner vertex ends up in set S with a certain probability, and this distribution can be 
realized alternatively by sampling directly from a ( |BCR[ ) solution. 

Lemma 15. Let y be a solution to ( |DCR| l for a given quasi-bipartite Steiner tree instance, and let 
X = '^{y). Then in any iteration of CKP, the probability of choosing a component with center v is 
exactly x {S^{v)) /M. 

Proof. Consider a Steiner vertex v, and let /C„ be the set of full components that have v as their centre. 
The definition of <f> immediately shows that 



x{5+{v))= yK. 



This obviously implies the lemma as the right-hand side of the above equality, scaled by A/, is the 
probability that a component with centre v is sampled. □ 

Consequently, we also have M = \^y = Yl,vev\R x{5^{v)). We can now simulate Algorithm CKP 
using the optimal solution of ( |BCR| l. 

Algorithm 16. CKP2 

Require: x is an optimal basic feasible solution of ( |BCR| l, and M = Tlivev\B. ^ ('^^(^))- 
for i 1 il/lnSdo 

Sample a Steiner vertex v with probability jjx {5^{v)). 
end for 

return a minimum spanning tree on the terminals and the sampled Steiner vertices. 
To complete our argument, if we run Algorithm 16 on a minimal ( BCR| l solution x, then by Lemma 15 



the expected cost agrees with that of Algorithm CKP run on the ( |DCR| l solution y that is obtained by 
decomposing x. Both x and y are optimal for ( |BCR[ ) and ( |DCR| l respectively, so our claim follows. 
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Appendix 

Proof of Theorem |3] 

First we need to consider the dual (DCRy?) of (DCR/ I. To slightly simplify notation, we will denote 
f{U) also by fu, so that (DCRj ) reads as follows. 

max /^z (DCR^) 
s.t. z (A+ ) <CK VKeJC 
zu >0 V valid UCR 

To prove the theorem, consider z to be feasible to (DCRj*) with non-negative costs c. What we show 
next is that 

fz < c^x (5) 



is valid for (BCR/ I. We point here that we may assume, without loss of generality, that there are no arcs 
between terminals; this may be accomplished by splitting such arcs into two, putting a non-terminal in 
between. Since we are in the quasi-bipartite case, there are no arcs between non-terminals either. 

Before we proceed with the proof of (|5]l, we need to show that we may also assume that the solution 
z of (DCR^) enjoys a nice structural property. 

Lemma 17. Every optimal solution of(DCR^) has laminar support. 

Proof. Our proof is by contradiction. Let z be an optimal solution to (DCR^ ) that maximizes J2ucr\^\'^^u- 
If the support of z is not laminar, there must exist two intersecting subsets S, T of the terminals with 
zs,zt > 0. So let e = min {zs,zt} and define 

z' := z + e {esnT + esuT ~ es - ex) , 

and note that z' > 0. Now we claim that z' is an optimal solution to (DCR^'). Indeed, for each K E K,, 
submodularity of and the fact that z is feasible in (DCR^*) imply 

z' (A+) = z (A+) + e (A+ (5 n T) + A+{S U T) - A+{S) A+ (T)) < fu. 
Thus, z' is feasible in (DCR^'). Moreover, intersecting supermodularity of / implies 

f^z' ^f^z + e ifsnr + bsuT - fs - fr) > f^z, 

which proves optimality for z' . Note then that a; M- is strictly convex and jS* n Tj < minllS*!, |r|}. 
The contradiction then (assuming the non-laminarity of z) is that 

J2 E \U\'zu + ^{\SnT\' + \SUT\^^\S\'^\T\') > ^ \U\'zu- 

UCR UCR UCR 

□ 
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We are now ready to start the proof of (|5]l. Our argument uses induction on | supp(z) |. 

The base case of our induction is simple since if | supp(z)| = 0, we have z = 0. But a:: > is valid 
for ( |BCR/| and so is c^x>0 = f'^z. 

Now suppose \supp{z)\ > 1. Let Ti, . . . ,Tk be the inclusion-wise maximal sets in supp(z), as 
they follow from Lemma 17 Since supp(z) is laminar, Ti, . . . .T^ are disjoint. Next we distinguish the 
cases k — 1 and k > 2, and for each of them (building on the inductive argument) we conclude that 
/^z < c^x is valid. In both cases below we denote hy N ^ V \ R the set of non-terminals. 

(The case A; = 1): The laminar family has one element T = Ti. For each v ^ N we define 

r„ := min {c„i, : vu ^ A, u ^ R, u ^ T} U {+00} . 

Next, order the elements of N as vi, . . . ,vg such that t„j < Tj,^ < • • • < t^^. Let tQ = and tg^i = zt, 
and for each 1 < i <£, let ti = min {Ty. , zt}- For each 1 < i < ^ + 1, we also define 

:= iU - U^ijer 
c* := {ti - i^-l)xs+(T\J{vi,...,vl})^ 
Ow: next claim is that /^z' < {c^ Y'x is valid for ( BCR / 1. Indeed, note that the inequality at hand is just 
a scaling (by ti — i^-i) of the ( |BCR/| l inequality 

/t < ^ (5+„(T U {w,, . . . , vi})xuv 

uv^A 

and thus /^z' < (c*)^x must be valid for (BCR/J. Next we define 

e+i e+i 
z':=z-'^z\ and c':=c-^c'. 

i=l i=l 

and we note that 

I _ { Cy-u-tj , if u e R\T 

~ 1 Cyy^-ZT + tj , if ueT 

since J2t=i ^vju = *j when u £ R\T, while ^uv, — zt — tj when u eT. 

The first important observation then is that by the definition of z' we have 

e+i e+i 

z = z' + ^z'^ = z' + ^{ti - ti_i)eT = z' + {te+i - to)eT = z' + ztBt- 

i=l 1=1 

This means that | supp(2')| = | supp(z)| — 1. In what follows we show that (i) c' can be thought as a 
non-negative cost function (see Claimllli and (ii) that z' is feasible to (DCRy?) with cost c' (see Claim 
Note that (i),(ii), along with the observation that | supp(2:')| — \ supp(z)| — 1 show that z' < {c') x 
is valid for (BCR/ 1 (due to the inductive hypothesis). This allows us to conclude that also the inequality 

b^{z' + z' + --- + < (c' + + • • • + c'+^fx, 

is valid for (BCR/J . The latter inequality is just /^z < c^x, which completes the inductive argument, 
and the case k ~ \. 

Thus it remains to argue formally about (i),(ii) above. 

Claim 1. c' as defined above is non-negative. 

Proof. First take vu ^ A with v ^ N and u £ R. If u £ T, then — > 0. Otherwise, let 
1 < j < ^ be such that Vj — v. But then 

^VU ^VU tj Cyy^ min {t\,, Z^^} ^ ^ Cyu Cyii — 0. 
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For the other case, take uv E A with u £ R and v E N. If u ^ T, then c'^^ — c^v > 0. Otherwise, let 
1 < j < ^ be such that vj — v. 

If tj = zt, then c^y — c„^ — zt + tj > 0. On the other hand, if tj ~ Ty, then let w e i? \ T be such 
that vw G A and = c^w Then the component K with source tt, sink w, and non-terminal w crosses 
T, so by feasibility of z, we have 

> ^ A+ (J7)zc/ > ZT, 

which implies that = + c„u, — zt > 0. □ 

Finally, feasibility of z' in the dual of (DCR/J with cost function c' is given by the next claim. 
Claim 2. z' is feasible to (DCR^ ) with costs c'. 

Proof. First, note that z' > follows from its definition. Now take a component K with non-terminal 
u € F \ i? and sink u E R. Let 1 < j < ^ be such that vj = v. 

Suppose first that K does not cross T. If sources (if) n T = 0, then A+ ([/) = whenever U C R 
and Z(7 > 0. Therefore, since by Claim[T]we have c' > 0, we conclude that 

J2A+iU)z'^ = 0<c'iK). 

UCR 

On the other hand, suppose sources (iC) n T 7^ but u E T. If tj = zt, we have 

i+i e+i 

^c'(X)== ^ (<, -ii_i)|sources(if)nr| =0. 

Now, since K does not cross T, the above implies that 

c\K) = c{K) > A+ (C/)z^ ^ ^k{U)z'jj. 

UCR UCR 



If tj = Ty, let u' E R\T be such that Ty = Cyyi and let W = sources(-ft') n T. By summing the 



inequalities of (DCR?) corresponding to components of the form wvv! for each vj E W, we have 



«)6W weW UCR 

i7C_R wGW 



> Y(T.^^umzu + \w\zT 

UCR wew 

> J2^KiU)zu + \W\zT. 



UCR 

Therefore 

> Y ^i(U)zu - \W\cyu' + \W\zT - \W\{zT - tj) = Y ^UU)zu, 

UCR UCR 

as required. 

Now suppose K crosses T. Let W — somces{K) OT , and let K' be the sub-component of K having 
sources W. We have 

c'iK) > c'{K') = Y ""'^v + C = E - + ^:') + ^™ " 
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When tj = zt, we recall that K crosses T, and that T is outermost, hence 

c'iK) > c{K') -ZT>J2 ^UU)zu -ZT=Y1 ^ii'^K- 

UCR UCR 

In the final case where tj = t„, let «' e R\T achieve the maximum in the definition of r„, and let 
W = sources(ii') fl T. Then, again c' is non-negative because 

c'{K) > {Cyjy - Zt + Cyu') + Cvu - Cvu' > ^ {Cwv + Cvu') - \W\zt 

> E (E ^L'iU)zu) - \W\ZT = E ( E ^L'iUWu 
wew UCR UCR wew 



> Y.^UU)z'u^Y.^k(U)z'u. 



UCR UCR 

□ 

(The case fc > 2): Recall that by = y \ i? we denote the set of non-terminals. As in the previous 
case, we define non-negative cost function c' along with z' of smaller support so as to use the inductive 

hypothesis. For each 1 < i < fc, we define now c* € as follows. 
For each uv & A with u G R and v G N,we set 

^. _ \cuv ifu e Ti 
1 otherwise, 

while for each vu G A with v G N and u e i? we define 



Cyu ■= max 



\ --W CTi such that wv G AWw gW j ' 



Our first claim is that > 0, for every 1 < i < fc. The reason is that if uv G A with u G R and 
V G N,we clearly have c^^ > 0. If on the other hand vu G A with v G N and u G R, just take W = % 
in the definition of cj,^ to get > 0. 

Our next claim is that c dominates the sum of c*'s, namely 

-h • • • c'= < c. (6) 

To see why, let uv G A with u G R and v G N. Since Tj's are disjoint, we have that ci„ -I hc^^ < 

Cuv Next take vu G A with v G N and u G R. For each 1 < i < /c, let VFi C Tj achieve the maximum 
in the definition of c*„. Consider the component K with sources W := Wi U • • • U Wk, non-terminal v, 
and sink u. Since 2; is feasible to (DCRj?) with costs c, we have 

c{K) > ^ zu^f: E -- = E E 

U: UCR,A+{U) = 1 »=1 i7: (7CTi ,A+ (i7)=l »=1 t^: i7CTi,i7nW'i7t0,u5^i7 

k k 

i=i weWi i=i wew 1=1 



The latter implies that 



as we claimed. 



k 
i=l 
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Next, similarly to the case fc = 1, we define e for each 1 < i < fc as follows. For U C R we 

set 

, _ jzu ifC/CT, 
I otherwise. 

From our definition, it is immediate that Yli=i — ^- Analogously to Claimji] we show again that 
Claim 3. For 1 < i < k, vector is feasible to (DCR^) with costs d 

Proof. Indeed, consider a component K with non-terminal v E N and sink u E R. Then, by setting 
W = sources(i4r) n T; in the definition of c* we obtain 

C (K) = C„„ + y ^ C^,^, — C^^ + ^ ^ Cujt; 

tDesourccs(i<') w£W 
U: UCTi,UnW^<i,u^U U: UCR,Unsouiccs{K)^li,u^U 

UCR 

Since > 0, this shows is feasible to (DCRy*) with costs c\ □ 

Since we are dealing with the case fc > 2, we have | supp(z')| < | supp(z)|, for all 1 < j < fc, so by 
induction, ti^z^ < {d)'^x is valid for (BCR/ i for each 1 <i <k. 
By summing over all I's, we get that 

/^z = /T(zi + . . . + ^fe) < (ci + . . . + c'-O^x < c^x 
is also valid for (BCR/ i which completes the inductive proof. Altogether, this justifies (|5]l. 
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